Learning to Extract Genic Interactions Using Gleaner

نویسندگان

  • Mark Goadrich
  • Louis Oliphant
  • Jude Shavlik
چکیده

We explore here the application of Gleaner, an Inductive Logic Programming approach to learning in highly-skewed domains, to the Learning Language in Logic 2005 biomedical information-extraction challenge task. We create and describe a large number of background knowledge predicates suited for this task. We find that Gleaner outperforms standard Aleph theories with respect to recall and that additional linguistic background knowledge improves recall.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relation Extraction in the Biological Domain Using Simple Rule Generation and Pairwise Classification to Extract Genic Interactions from Bibliographical Databases

This paper presents two Relation Extraction (RE) systems designed to identify genic interactions in unstructured text. The systems are trained, tested, and evaluated using the data sets from the Learning Language in Logic (LLL05) Genic Interaction Extraction Challenge. The first system uses the (LP )2 algorithm as implemented in the information extraction program Amilcare [Ciravegna, 2003] to i...

متن کامل

Learning ontological rules to extract multiple relations of genic interactions from text

INTRODUCTION Information extraction (IE) systems have been proposed in recent years to extract genic interactions from bibliographical resources. They are limited to single interaction relations, and have to face a trade-off between recall and precision, by focusing either on specific interactions (for precision), or general and unspecified interactions of biological entities (for recall). Yet,...

متن کامل

Learning Ensembles of First-Order Clauses for Recall-Precision Curves: A Case Study in Biomedical Information Extraction

Many domains in the field of Inductive Logic Programming (ILP) involve highly unbalanced data. Our research has focused on Information Extraction (IE), a task that typically involves many more negative examples than positive examples. IE is the process of finding facts in unstructured text, such as biomedical journals, and putting those facts in an organized system. In particular, we have focus...

متن کامل

Competition of phytoplankton under fluctuating light.

Light is an essential resource for phytoplankton and fluctuates on a wide range of timescales. To understand how light fluctuations affect phytoplankton community structure and diversity, we have studied a set of simple models using a combination of analytical and numerical techniques. Light fluctuations can affect community structure when species exhibit the gleaner-opportunist trade-off betwe...

متن کامل

Genic Interaction Extraction with Semantic and Syntactic Chains

This paper describes the system that we submitted to the “Learning Language in Logic” Challenge of extracting directed genic interactions from sentences in Medline abstracts. The system uses Markov Logic, a framework that combines log-linear models and First Order Logic, to create a set of weighted clauses which can classify pairs of gene named entities as genic interactions. These clauses are ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005